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Abstract 

Neural codes allow the brain to represent, process, and store information about the world. Combi¬ 
natorial codes, comprised of binary patterns of neural activity, encode information via the collective 
behavior of populations of neurons. A code is called convex if its codewords correspond to regions 
dehned by an arrangement of convex open sets in Euclidean space. Convex codes have been observed 
experimentally in many brain areas, including sensory cortices and the hippocampus, where neurons 
exhibit convex receptive helds. What makes a neural code convex? That is, how can we tell from 
the intrinsic structure of a code if there exists a corresponding arrangement of convex open sets? In 
this work, we provide a complete characterization of local obstructions to convexity. This motivates 
us to dehne max intersection-complete codes, a family guaranteed to have no local obstructions. We 
then show how our characterization enables one to use free resolutions of Stanley-Reisner ideals in 
order to detect violations of convexity. Taken together, these results provide a signihcant advance in 
understanding the intrinsic combinatorial properties of convex codes. 
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1 Introduction 

Cracking the neural code is one of the central challenges of neuroscience. Typically, this has been 
understood as hnding the relationship between the activity of neurons and the stimuli they represent. 
To uncover the principles of neural coding, however, it is not enough to describe the various mappings 
between stimulus and response. One must also understand the intrinsic structure of neural codes, 
independently of what is being encoded [1] . 

Here we focus our attention on convex codes, which are comprised of activity patterns for neurons 
with classical receptive helds. A receptive field Ui is the subset of stimuli that induces neuron i to 
respond. Often, Ui is a convex subset of some Euclidean space (see Eigure[^. A collection of convex sets 
Ui,... ,Un naturally associates to each point x G a binary response pattern, ci • • • c„ G {0,1}”, 
where Cj = 1 if x G C/j, and Cj = 0 otherwise. The set of all such response patterns is a convex code. 

Convex codes have been observed experimentally in many brain areas, including sensory cortices and 
the hippocampus. Hubei and Wiesel’s discovery in 1959 of orientation-tuned neurons in the primary 
visual cortex was perhaps the hrst example of convex coding in the brain [2]. This was followed by 
O’Keefe’s discovery of hippocampal place cells in 1971 [3], showing that convex codes are also used in 
the brain’s representation of space. Both discoveries were groundbreaking for neuroscience, and were 
later recognized with Nobel Prizes in 1981 [1] and 2014 [5], respectively. 
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Our motivating examples of convex codes are, in fact, hippocampal place cell codes. A place cell 
is a neuron that acts as a position sensor, exhibiting a high firing rate when the animal’s location lies 
inside the cell’s preferred region of the environment - its place field. Figure displays the place fields 
of four place cells recorded while a rat explored a two-dimensional environment. Each place held is an 
approximately convex subset of M^. Taken together, the set of all neural response patterns that can 
arise in a population of place cells comprise a convex code for the animal’s position in space. Note that 
in the neuroscience literature, convex receptive helds are typically referred to as unimodal, emphasizing 
the presence of just one “hot spot” in a stimulus-response heat map such as those depicted in Figure 


place field of neuron #1 


O 


place field of neuron #2 place field of neuron #3 



place field of neuron #4 
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Figure 1: Place helds of four CAl pyramidal neurons (place cells) in rat hippocampus, recorded while 
the animal explored a 1.5m x 1.5m square box environment. Red areas correspond to regions of 
space where the corresponding place cells exhibited high bring rates, while dark blue denotes near-zero 
activity. Place helds were computed from data provided by the Pastalkova lab, as described in [ 6 ]. 

Despite their relevance to neuroscience, the mathematical theory of convex codes was initiated only 
recently mm- In particular, it is not clear what are the intrinsic combinatorial signatures of convex and 
non-convex codes. Identifying such features will enable us to infer coding properties from population 
recordings of neural activity, without needing a priori knowledge of the stimuli being encoded. This 
may be particularly important for studying systems like olfaction, where the underlying “olfactory 
space” is potentially high-dimensional and poorly understood. Having intrinsic signatures of convex 
codes is a critical step towards understanding whether something like convex coding may be going on 
in such areas. Understanding the structure of convex codes is also essential to uncovering the basic 
principles of how neural networks are organized in order to learn, store, and process information. 


1.1 Convex codes 

By a neural code, or simply a code, on n neurons we mean a collection of binary strings C C {0,1}”. 
The elements of a code are called codewords. We interpret each binary digit as the “on” or “off” 
state of a neuron, and consider 0/1 strings of length n and subsets of [n] = { 1 ,..., n} interchangeably. 
For example, 1101 and 0100 are also denoted {1,2,4} and {2}, respectively. We will always assume 
00 • • • 0 G C (i.e., 0 G C); this assumption simplihes notation in various places but does not alter the 
core results. 

Let A be a topological space, and consider a collection of open sets U = {Ui,... ,Un}, where 
Ui C X. Any such U dehnes a code. 


C{U) < aC 


[n] I Her \ Uj 


je[n]\a 


( 1 ) 


where Ua- Hi^a-Ui for a C [n]. In other words, each codeword a in C(Zd) corresponds to the portion 
of Ucr that is not covered by other sets. In particular, C(U) is not the same as the nerve of the cover. 


MiU) (see Section 3.2). By convention, C /0 = X, and so 0 G CiU). 
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For any code C, there always exists an open cover U such that C = C{U) [U Lemma 2.1]. However, 
it may be impossible to choose the UiS to all be convex. We thus have the following definitions, which 
were first introduced in [1]: 

Definition 1.1. Let C be a binary code on n neurons. If there exists a collection of open sets U = 
{Ui ,..., Un} such that C = C{U) and the UiS are all convex subsets of then we say that C is a 
convex code. The smallest d such that this is possible is called the minimal embedding dimension, d{C). 


Note that the definition of a convex code is extrinsic: a code is convex if it can be realized by an 
arrangement of convex open sets in some Euclidean space. How can we characterize convex codes 
intrinsically! If a code is not convex, how can we prove this? If a code is convex, what is the minimal 
dimension needed for the corresponding open sets? 

In this work, we tackle these questions by building on mathematical ideas from [T] and jS]. In 
particular, we study local obstructions to convexity, a notion first introduced in [8]. Our main result 
is Theorem 1.3, which provides a complete characterization of codes with no local obstructions. In 


Section we present a series of examples that illustrate the ideas summarized in Section [L3| Sections 
and 1^ are devoted to additional background and technical results needed for the proof of Theorem |1.3[ 
Finally, in Section we show how tools from combinatorial commutative algebra, such as Hochster’s 
formula, can be used to determine that a code is not convex. 


1.2 Preliminaries 

Simplicial complexes. A simplicial complex A on n vertices is a nonempty collection of subsets of 
[n] that is closed under inclusion. In other words, if cr G A and t (Z a, then r G A. The elements of a 
simplicial complex are called simplices or faces. The dimension of a face, u G A, is defined to be |(t| — 1. 
The dimension of a simplicial complex A is equal to the dimension of its largest face: maXo-gA |<7| — 1- 
If A consists of all 2” subsets of [n], then it is the full simplex of dimension n — 1. The hollow simplex 
contains all proper subsets of [n], but not [n], and thus has dimension n — 2. 

Faces of a simplicial complex that are maximal under inclusion are referred to as facets. If we 
consider the facets together with all their intersections, we obtain the set 

J^n(A) 1Pi I Pi is a facet of A for each i = 1,..., fej' U {0}. 

We refer to the elements of J^n(^) as max intersections of A. The empty set is added so that 
can be regarded as a code, consistent with our convention that the all-zeros word is always included. 

Restrictions and links are standard constructions from simplicial complexes. The restriction of A 
to a is the simplicial complex 

A|o- {w G A I w C O'} . 

For any u G A, the link of a inside A is 

Lko-(A) =^{ti;GA|(Tna; = 0 and a U a; G A} . 

Note that it is more common to write LkA(i7) or linkA(o'), instead of Lk(j(A) (see, for example, [9]). 
However, because we will often fix a and consider its link inside different simplicial complexes, such as 
A|o-ut-, it is more convenient to put a in the subscript. 

The simplicial complex of a code. To any code C, we can associate a simplicial complex A(C) by 
simply including all snbsets of codewords: 

A(C) {(T C [n] I (T C c for some c G C} . 


3 



A(C) is called the simplicial complex of the code, and is the smallest simplicial complex that contains all 
elements of C. The facets of A(C) correspond to the codewords in C that are maximal under inclusion: 
these are the maximal codewords. 


Local obstructions to convexity. At first glance, it may seem that all codes should be convex, 
since the convex sets Ui can be chosen to reside in arbitrarily high dimensions. This is not the case, 
however, as non-convex codes arise for as few as n = 3 neurons [I]. To understand what can go wrong, 
consider a code with the following property: any codeword with a 1 in the first position also has a 
1 in the second or third position, but no codeword has a 1 in all three positions. This implies that 
any corresponding cover U must have Ui C U 2 U C/3, but C 7 i H C/2 n C/3 = 0. The result is that Ui is a 
disjoint union of two nonempty open sets, Ui n C/2 and C/i H C/3, and is hence disconnected. Since all 
convex sets are connected, we conclude that our code cannot be convex. The contradiction stems from 
a topological inconsistency that emerges if the code is assumed to be convex. 

This type of topological obstruction to convexity generalizes to a family of local obstructions, first 
introduced in [8] . We define local obstructions precisely in Section There we also show that a code 
with one or more local obstructions cannot be convex: 


Lemma 1.2. If C is a convex code, then C has no local obstructions. 


This fact was first observed in [8], using slightly different language. The converse, unfortunately, is not 
true. See Example 2.3 for a counterexample that first appeared in [TO] . 


1.3 Summary of main results 

To prove that a neural code is convex, it suffices to exhibit a convex realization. That is, it suffices to 
find a set of convex open sets U = {C/i,.. .,[/„} such that C = C(/l). Our strategy for proving that 
a code is not convex is to show that it has a local obstruction to convexity. Which codes have local 
obstructions? 

Perhaps surprisingly, the question of whether or not a given code C has a local obstruction can be 
reduced to the question of whether or not it contains a certain minimal code, Cmin(A), which depends 
only on the simplicial complex A = A(C). This is our main result: 

Theorem 1.3 (Characterization of local obstructions). For each simplicial complex A, there is a 
unique minimal code Cmin(A) with the following properties: 

(i) the simplicial complex of C m^n fA) is A, and 

(ii) for any code C with simplicial complex A, C has no local obstructions if and only if C 5 Cniin(A). 
Moreover, Cmin(A) depends only on the topology of the links of A: 

Cmin(A) = {(T G A I LkCT(A) is non-contractible} U {0}. 


We will regard the elements of Cmin(A) as “mandatory” codewords with respect to convexity, 
because they must all be included in order for a code C with A(C) = A to be convex. From the above 
description of Cmin(A), we can prove the following lemma. 

Lemma 1.4. Cniin(A) C J^pi(A). That is, each nonempty element of Cmin(/^) is an intersection of facets 
of A. 


Our proofs of Theorem 1.3 and Lemma |1.4| are given in Section 4.2 Unfortunately, finding all 
elements of Cmin(A) for arbitrary A is, in general, undecidable (see Section]^. Nevertheless, we can 
algorithmically compute a subset, /V4i^(A), of “homologically-detectable” mandatory codewords. In 
Sectionj^we show how to compute using machinery from combinatorial commutative algebra. 
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Lemma 1.4 also tells us that every element of Cmin(^) must be an intersection of facets of A ~ that is, 
an element of -Fn(A). We thus have the inclusions: 

Mh(A) C Ci„in(A) C ^n(A), (2) 


where both and -Fn(A) are straightforwardly computable. Note that if = -^n(A), 

then we can conclude that Cmin(A) = Moreover, if C A then C A Cmin(A), and thus C 

has no local obstructions (and is potentially convex). This motivates the following definition: 

Definition 1.5. A neural code C is max D-complete (or max intersection-complete) if C A.Fn(A(C)). 

We therefore have a simple combinatorial condition for a code that guarantees it has no local 
obstructions: 


Corollary 1.6. If a neural code C is max H-complete, then C has no local obstructions. 

For n < 4, the convex codes are precisely those codes that are max H-complete. 

Proposition 1.7. Let C be a code on n < 4 neurons. Then C is convex if and only if C is max 
H-complete. 


This is shown in Supplementary Text SI, where we provide a complete classification of convex codes 
on n = 4 neurons. Proposition 1.7 does not extend to n > 4, however, since beginning in n = 5 there 
are convex codes that are not max H-complete (see Example 2.2). This raises the question: are there 
max H-complete codes that are not convex? In a previous version of this paper, we conjectured that 
all max H-complete codes are convex HU. This conjecture has recently been proven [TU, using ideas 
similar to what we illustrate in Example |2.4[ 


Proposition 1.8 ( [T^ Theorem 4.4]). If C is max H-complete, then C is convex. 


Einally, in Supplementary Text S3 we present some straightforward bounds on the minimal em¬ 
bedding dimension d{C), obtained using results about d-representability of the associated simplicial 
complex A(C). In particular, we find bounds from Kelly’s theorem and the Eractional Kelly theo¬ 
rem. Unfortunately, these results all stem from A(C). In our classification of convex codes for n < 4, 
however, it is clear that the presence or absence of specific codewords can affect d{C), even if A(C) 
remains unchanged (see Tablej^in Supplementary Text SI). The problem of how to use this additional 
information about a code in order to improve the bounds on d{C) remains wide open. 


2 Examples 

Our first example depicts a convex code with minimal embedding dimension d{C) = 2 . 

Example 2.1. Consider the open cover U illustrated in Figure]^. The corresponding code, C = C{U), 
has 10 codewords. C is a convex code, by construction, and it is easy to check that d{C) = 2 . The 
simplicial complex A(C) (see Figure]^) loses some of the information about the cover U that is present 
in C. In particular, t /2 C [/i U U 3 and 1/2 H 1/4 C U 3 is reflected in C, but not in A(C). Note that we can 
infer U 2 UiD U 3 directly from the code, because any codeword with neuron 2 “on” also has neuron 
1 or 3 “on.” 

Note that the convex code in Example |2.1| is also max H-complete, as guaranteed by Proposition [L^ 
The next example shows that this proposition does not hold for n > 5. 

Example 2.2. The simplicial complex A shown in Figure has facets 123, 134, and 145. Their 
intersections yield the faces 1, 13, and 14, so that /(-((A) = {123,134,145,13,14,1, 0}. For this A, we 
can compute the minimal code with no local obstructions, Cmin(A) = {123,134,145,13,14,0}, as in 
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Figure 2: (a) An arrangement lA = {Ui,U 2 -,U^,Ui} of convex open sets. Black dots mark regions 
corresponding to distinct codewords in C = CiU). From left to right, the codewords are 0000, 1000, 
1100, 1010, 1110, 0110, 0010, 0111, 0011, and 0001. (b) The simplicial complex A(C) for the code C 
defined in (a). The two facets, 123 and 234, correspond to the two maximal codewords, 1110 and 0111, 
respectively. This simplicial complex is also equal to the nerve of the cover N{U) (see Section 3.2). 



Figure 3: (a) A simplicial complex A on n = 5 vertices. The vertex 1 is an intersection of facets, but 
is not contained in the code C of Example 2.2 (b) The link Lki(A) (see Section 3.3). (c) A convex 


realization of the code C. The set Ui corresponding to neuron 1 (shaded) is completely covered by the 
other sets U 2 , ■ ■ ■ ,11^, consistent with the fact that 1 ^ C. 


Theorem 1.3 Note that the element 1 G -Fn(A) is not present in Cmin(A). Now consider the code 


C = A \ {!}. Clearly, this code has simplicial complex A(C) = A : it has a codeword for each face of A, 
except the vertex 1 (see Figure]^). By Theorem 1.3, C has no local obstructions because C 3 Cmin(A). 
However, C is not max H-complete because Jri(A) ^ C. Nevertheless, C is convex. A convex realization 
is shown in Figure [^. 

The absence of local obstructions is a necessary condition for convexity. Unfortunately, it is not 
sufficient: the following example shows a code with no local obstructions that is not convex. 


Example 2.3 ([lO]). Consider the code C = {2345,123,134,145,13,14, 23, 34,45, 3, 4, 0}. The simpli¬ 
cial complex of this code, A = A(C), has facets {2345,123,134,145}. It is straightforward to show that 
C = Cmin(A), and thus C has no local obstructions. Despite this, it was shown in [10] using geometric 
arguments that C is not convex. Note that this code is not max C-complete (the max intersection 
123 n 134 n 145 = 1 is not in C). 


The next example illustrates how a code with a single maximal codeword can be embedded in M^. 
This basic construction is used repeatedly in our proof of Proposition 1 1. 7| (see Supplementary Text SI), 
and inspired aspects of the proof of Proposition 1.8 given in |12j . 


Example 2.4. Consider the code C = {1111,1011,1101,1100,0011,0010,0001,0000}, with unique 
maximal codeword 1111. Eigure depicts the construction of a convex realization in All regions 
corresponding to codewords are subsets of a disk in Eor each z = 1,..., 4, the convex set Ui is the 
union of all regions whose corresponding codewords have a 1 in the position. Eor example, Ui is 
the union of the four regions corresponding to codewords 1111, 1011, 1101, and 1100. 

The above construction can be generalized to any code with a unique maximal codeword. 
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Figure 4: A convex realization in of the code in Example 2.4, (Left) Each non-maximal codeword 


is assigned a region outside the polygon, but inside the disk. (Right) For each neuron i, the convex set 
Ui is the union of all regions corresponding to codewords with a 1 in the position. 


Lemma 2.5. Let C be a code with a unique maximal codeword. Then C is convex, and d{C) < 2. 

Proof. Let p G C be the unique maximal codeword, and let m = |C| — 2 be the number of non-maximal 
codewords, excluding the all-zeros word. Inscribe a regular open m-gon P in an open disk, so that 
there are m sectors surrounding P, as in Figure]^ (If m < 3, let P be an open triangle.) Assign each 
non-maximal codeword (excluding 00 • • • 0) to a distinct sector inside the disk but outside of P, and 
assign the maximal codeword p to P. Next, for each i G p let Ui be the union of P and all sectors 
whose corresponding codewords have a 1 in the position, together with their common boundaries 
with P. For j G [n] \ p, set Uj = 0. Note that each Ut is open and convex, and C = C{{Ui}). □ 

Lemma |2.5| can easily be generalized to any code whose maximal codewords are non-overlapping 
(that is, having disjoint supports). In this case, each nonzero codeword is contained in a unique facet 
of A(C), and the facets thus yield a partition of the code. We can repeat the above construction in 
parallel for each part, obtaining the same dimension bound. 

Proposition 2.6. Let C be a code with non-overlapping maximal codewords (i.e., the facets of A(C) 
are disjoint). Then C is convex and d{C) < 2. 

3 Local obstructions to convexity 

For any simplicial complex A, there exists a convex cover U in a high-enough dimensional space 
such that A can be realized as A(C(^)) |13) . For this reason, the simplicial complex A(C) alone is 
not sufficient to determine whether or not C is convex. Obstructions to convexity must emerge from 
information in the code that goes beyond what is reflected in A(C). As was shown in [ 1 ], this additional 
information is precisely the receptive field relationships, which we turn to now. 

3.1 Receptive field relationships 

For a code C on n neurons, let U = {Ui,... ,Un} be any collection of open sets such that C = C{ly(), 
and recall that = n^e.Ui. 

Definition 3.1. A receptive field relationship (RF relationship) of C is a pair (u, r) corresponding to 
the set containment 

iGr 

where u / 0, cr n r = 0, and UUi lb for all i G r. 
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If r = 0, then the relationship (o', 0) simply states that = 0. Note that relationships of the form 
(cr, 0) reproduce the information in A(C), while those of the form (cr, r) for r 7 ^ 0 reflect additional 
structure in C that goes beyond the simplicial complex. A minimal RF relationship is one such that 
no single neuron can be removed from cr or r without destroying the containment. 

It is important to note that RF relationships are independent of the choice of open sets lA (see 
Lemma 4.2 of [T]). Hence we denote the set of all RF relationships {(cr, r)} for a given code C as simply 
RF(C). In m, it was shown that one can compute RF(C) algebraically, using an associated ideal Iq- 


Example 3.2 (Example |2.1| continued). The code C = C{U) from Example 2.1 has RF relationships 
RF(C) = {({1,4},0),({1,2,4},0),({1,3,4},0),({1,2,3,4},0),({2},{1,3}),({2},{1,3,4}),({2,4},{3})}. 
Of these, the pairs ({1,4}, 0), ({ 2 }, { 1 , 3}), and ({ 2 ,4}, {3}), corresponding to [/int /4 = 0, 1/2 C t/iUt/ 3 , 
and 1 / 201 / 4 ^ t /3 respectively, are the minimal RF relationships. 


The following lemma illustrates a simple case where RF relationships can be used to show that a 
code cannot have a convex realization. (This is a special case of Lemma 3.6 below.) 


Lemma 3.3. Let C = C{IA). If C has RF relationships Ui Li Uj and I/o- n t/j n Uj = 0 for some 

cr C [n] and distinct i,j ^ cr, then C is not a convex code. 


Proof. By assumption, {(cr, {i,j}), (cr U {i, j},0)} C RF(C). It follows that the sets Vi = LiUi % 
and Vj = Ua P Uj / 0 are disjoint open sets that each intersect U^j, and Uo- C V) U V)-. We can thus 
conclude that is disconnected in any open cover U such that C = CiU). This implies that C cannot 
have a convex realization, because if the UiS were all convex then would be convex, contradicting 
the fact that it is disconnected. □ 


The above proof relies on the observation that must be convex in any convex realization lA, but 
the properties of the code imply that Ua is covered by a collection of open sets whose topology does 
not match that of a convex set. This topological inconsistency between a set and its cover is, at its 
core, a contradiction arising from the Nerve lemma, which we discuss next. 


3.2 The Nerve lemma 

The nerve of an open cover lA = {Ui,... ,Un} is the simplicial complex 

M{U) {a C [n] I / 0}. 

In fact, NilA) = A(C(^)), so the nerve can be recovered directly from the code C{IA). The Nerve lemma 
tells us that M(U) carries a surprising amount of topological information about the underlying space 
covered by lA, provided lA is a good cover. Recall that a good cover is a collection of open sets {Ui} 
where every non-empty intersection, = flieo- is contractible^ 

Lemma 3.4 (Nerve lemma). If ZY is a good cover, then Ui is homotopy-equivalent to J\f{lA). In 
particular, (JILi MifA) have exactly the same homology groups. 

This result is well known, and can be obtained as a direct consequence of |14l Corollary 4G.3]. 

Now observe that an open cover comprised of convex sets is always a good cover, because the 
intersection of convex sets is convex, and hence contractible. For example, if C = CilA) for a convex 
cover lA, then A(C) must match the homotopy type of (JILi This fact was previously exploited to 
extract topological information about the represented space from hippocampal place cell activity [T5] . 
The Nerve lemma is also key to our notion of local obstructions, which we turn to next. 


set is contractible if it is homotopy-equivalent to a point M- 





3.3 Local obstructions 

Local obstructions arise when a code contains a RF relationship (fx, r), so that C Ui, but the 
nerve of the corresponding cover of by the restricted sets {Ui n Ua}i^T is not contractible. By the 
Nerve lemma, if the UiS are all convex then ^[{{Ua H Ui}i£r) must have the same homotopy type as 
Ua, which is contractible. If J\f{{Ua n Ui}i^r) fails to be contractible, we can conclude that the C/jS can 
not all be convex. 

Now, observe that the nerve of the restricted cover Af{{Ua n Ui}i^r) is related to the nerve of the 
original cover N{U) as follows; 

N{{Ua n Ui}i^r) = {uj G Mihi) \ a uj = %, a U io G and oj C r}. 

In fact, letting A = MiU) and considering the restricted complex A|o-ur, we recognize that the right- 
hand side above is precisely the link, 


N{{U, n Ui\i^r ) = Lk,(A| crUr ) • 


We can now define a local obstruction to convexity. 


Definition 3.5. Let (u, r) G RF(C), and let A = A(C). We say that (u, r) is a local obstruction of C 
if r / 0 and Lko-(A|o-ur) is not contractible. 


Local obstructions are thus detected via non-contractible links of the form Lko-(A|o-UT)! where 
(cr, r) G RF(C). Figure displays all possible links that can arise for |t| < 4. Non-contractible links 
are highlighted in red. Note also that r 7 ^ 0 implies a ^ C and IJa 7 ^ 0, as the definition of a RF 
relationship requires that D Ui 9 for all i G t. Any local obstruction (cr, r) must therefore have 
fj G A(C) \ C and Lk(j(A|o-UT) nonempty. 

The arguments leading up to the definition of local obstruction imply the following simple conse¬ 
quence of the Nerve lemma, which was previously observed in [H]. 


Lemma 3.6 (Lemma 1.2). If C has a local obstruction, then C is not a convex code. 


In general, the question of whether or not a given simplicial complex is contractible is undecidable 
m-, however, in some cases it is easy to see that all relevant links will be contractible. This yields a 
simple condition on RF relationships that guarantees that a code has no local obstructions. 


Lemma 3.7. Let C = C{U). If for each (cr, r) G RF(C) we have U Ur ^ 0, then C has no local 
obstructions. 


Proof. Let A = A(C). U^ UUr ^ implies Lko-(A|o-ur) is the full simplex on the vertex set r, which is 
contractible. If this is true for every RF relationship, then none can give rise to a local obstruction. □ 

For example, if 11 • • • 1 G C, then Lq. H tA / 0 for any pair rr, r C [n], so C has no local obstructions. 


4 Characterizing local obstructions via mandatory codewords 


From the definition of local obstruction, it seems that in order to show that a code has no local obstruc¬ 
tions one would need to check the contractibility of all links of the form Lko-(A|o-UT) corresponding to all 
pairs (cr, r) G RF(C). We shall see in this section that in fact we only need to check for contractibility 
of links inside the full complex A - that is, links of the form Lk(j(A). This is key to obtaining a list of 
mandatory codewords, Cmin(A), that depends only on A, and not on any further details of the code. 

In Section 4.1 we prove some important lemmas about links, and then use them in Section 4.2 to 


prove Theorem 1.3 
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Figure 5: All simplicial complexes on up to n = 4 vertices, up to permutation equivalence. These can each arise 
as links of the form Lkcr(A|crUT) for |r| < 4. Red labels correspond to non-contractible complexes. Note that 
L13 is the only simplicial complex on n < 4 vertices that is contractible but not a cone. 
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4.1 Link lemmas 


In what follows, the notation 


cone^(A) = {{u} U a; | cj G A} U A 


denotes the cone of v over A, where u is a new vertex not contained in A. Any simplicial complex that 
is a cone over a sub-complex, so that A = conet,(A'), is automatically contractible. In Figure]^ the 
only contractible link that is not a cone is L13. This is the same link that appeared in Figure of 


Example 2.2 


Lemma 4.1. Let A be a simplicial complex on [n], cr G A, and v G [n] such that v ^ a and cjU{u} G A. 
Then Lk^u{^}(A) C Lk,^(A|[„]\{^}), and 


Lk^(A) = Lk^(A|[„]\{^}) U cone„(Lk^uw(^))- 


Proof. The proof follows from the definition of the link. First, observe that 


Lko-u{j,}(A) = {oj <Z [n\ \ V ^ oj,uj r\ a = %, and w U u U {u} G A} 
= {w C [re] \ {u} I a; n (T = 0, and a; U cr U {u} G A} 
C {a; C [re] \ {u} | a; n cr = 0, and coU a G A|[„]\{^}} 
= Lk(j(A|[„]\{^}), 


which establishes that Lko.u{^}(A) C Lko-(A|[„]\{„|). Next, observe that 


cone^(Lk^u{^}(A)) \ Lk^u{^}(A) 


{{u} U w I w G Lk<^u{^}(A)} 

{{u} U w I w C [re], u ^ w, a; n CT = 0, and io U {u} U cr G A} 
{r C [re] I u G r, r n (T = 0, and r U cr G A} 

{uj G LkCT(A) I V G w}. 


Finally, 


Lka(A|[„]\{^}) 


From here the second statement is clear. 


{w G Lk(j(A) I u ^ cj}. 


□ 


Corollary 4.2. Let A be a simplicial complex on [re], a G A, and v G [re] such that v ^ a and 
a U {u} G A. If Lk^u{i;}(‘^) is contractible, then LkCT(A) and LkCT(A|[„]\|^j.) are homotopy-equivalent. 


Proof. Lemma 4.1 states that Lk^uliijC^) ^ kko-(A|[„]\|^|), and that Lko-(A) can be obtained from 


Lko-(A|[„]\{.y}) by coning off the subcomplex Lko-u{^}(A) - that is, by including cone.i,(Lko-u|^|(A)). If 
this subcomplex is itself contractible, then the homotopy type is preserved. □ 


Another useful corollary follows from the one above by simply setting A = Al^-u.^ul'!;} [re] = 
(tU rU {u}. We immediately see that if both Lko-(A|g.uT-u{.u}) and Lko.u{^,}(A|^uru{);}) contractible, 
then Lko-(A|o-ur) is contractible. Stated another way: 


Corollary 4.3. Assume v ^ a and a P t = %. If Lko-(A|o-UT) is not contractible, then either (i) 
Lko-(A|^UTU{i)}) is not contractible, and/or (ii) (tU{u} G A and Lk^u{i;}(^l(TUTU{'u}) is not contractible. 


This corollary can be extended to show that for every non-contractible link Lko-(A|o-UT)! there exists 
a non-contractible “big” link Lko-' (A) for some a' T a. This is because vertices outside of cr U r can 
be added one by one to either a or its complement, preserving the non-contractibility of the new link 
at each step. (Note that if cr U {u} ^ A, we can always add v to the complement. In this case, 
Lko-(A|CTUT) = Lko-(A|CTU.,-u{^,}), so we are in case (i) of Corollary 
following lemma: 


4.3 


In other words, we have the 
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Lemma 4.4. Let cr, r G A. Suppose u H r = 0, and Lko-(A|o-ur) is not contractible. Then there exists 
a' € A such that a' D a, a' Ci t = f!i, and Lko-'(A) is not contractible. 

The next results show that only intersections of facets (maximal faces under inclusion) can possibly 
yield non-contractible links. For any cr G A, we denote by the intersection of all facets of A 
containing a. In particular, a = fa- ii and only if a is an intersection of facets of A. It is also useful 
to observe that a simplicial complex is a cone if and only if the common intersection of all its facets is 
non-empty. (Any element of that intersection can serve as a cone point, and a cone point is necessarily 
contained in all facets.) 

Lemma 4.5. Let a € A. Then a = fa Lk(j(A) is not a cone. 

Proof. Recall that Lko-(A) is a cone if and only if all facets of Lk(j(A) have a non-empty common 
intersection u. This can happen if and only if aU u C fa. Note that since u G Lk(j(A), we must have 
n fj = 0 and hence Lko-(A) is a cone if and only if a fa- □ 

Furthermore, it is easy to see that every simplicial complex that is not a cone can in fact arise as 
the link of an intersection of facets. For any A that is not a cone, simply consider A = cone^(A); v is 
an intersection of facets of A, and Lk„(A) = A. 

The above lemma immediately implies the following corollary: 

Corollary 4.6. Let cr G A be nonempty. If fi 7 ^ fa, then Lk(j(A) is a cone and hence contractible. In 
particular, if Lko-(A) is not contractible, then a must be an intersection of facets of A (i.e., a G J-n(A)). 

Finally, we note that all pairwise intersections of facets that are not also higher-order intersections 
give rise to non-contractible links. 

Lemma 4.7. Let A be a simplicial complex. If u = ri n T 2 , where ti,T 2 are distinct facets of A, and 
a is not contained in any other facet of A, then Lko-(A) is not contractible. 

Proof. Observe that Lko-(A) consists of all subsets of = ti \ a and 0 J 2 = T 2 \ cr, but oji and 002 
are disjoint because ri and T 2 do not overlap outside of a. This means Lko-(A) has two connected 
components, and is thus not contractible. □ 

Note that if cr is a pairwise intersection of facets that is also contained in another facet, then Lko-(A) 
could be contractible. For example, the vertex 1 in Figure]^ can be expressed as a pairwise intersection 
of facets 123 and 145, but is also contained in 134. As shown in Figure the corresponding link 
Lki(A) is contractible. 


4.2 Proof of Theorem 11.31 and Lemma 11.41 


Using the above facts about links, we can now prove Theorem 1.3 and Lemma 1.4 First, we need the 
following key proposition. 


Proposition 4.8. A code C has no local obstructions if and only if cr G C for every a G A(C) such 
that Lko-(A(C)) is non-contractible. 

Proof. Let A = A(C), and let U = {Ui} be any collection of open sets such that C = C{ly(). (=i>) We 
prove the contrapositive. Suppose there exists a G A(C)\C such that Lko-(A) is non-contractible. Then 
Ua must be covered by the other sets {Ui}i^a^ aiid since LkCT(A) is not contractible, the RF relationship 
(cr, a) is a local obstruction. (-4=) We again prove the contrapositive. Suppose C has a local obstruction 
(cr, r). This means that a Pt = %, Ua fL Uier Lko-(A|o-ur) is not contractible. By Lemma 

there exists a' P a such that a' n r = 0 and Uka' (■A) is not contractible. Moreover, Ua' U Ua ’fL Uisr 
with cr' n r = 0 , which implies a' ^C. □ 


4.4 
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Theorem 1.3 now follows as a corollary of Proposition |4.8[ To see this, let 

C min (A) = {(T G A I LkCT(A) is non-contractible} U {0}, 

and note that Cmin(A) has simplicial complex A. This is because for any facet p G A, Lkp(A) = 0, 
which is non-contractible, and thus Cniin(A) contains all the facets of A. By Proposition |4.8[ any code 
C with simplicial complex A has no local obstructions precisely when C 3 Cmin(A). Thus, Cmin(A) is 


the unique code satisfying the required properties in Theorem 1.3 


Finally, it is easy to see that Lemma 1.4 follows directly from Corollary 4.6 


5 Computing mandatory codewords algebraically 

Computing Cmin(A) is certainly simpler than finding all local obstructions. However, it is still difficult 
in general because determining whether or not a simplicial complex is contractible is undecidable |16j . 
For this reason, we now consider the subset of Cmin(A) corresponding to non-contractible links that 
can be detected via homology: 

Mh{^) {(T G a I dimi/i(LkCT(A), k) > 0 for some i}, (3) 

where the Hi{-) are reduced simplicial homology groups, and k is a field. Homology groups are topo¬ 
logical invariants that can be easily computed for any simplicial complex, and reduced homology groups 
simply add a shift in the dimension of This shift is designed so that for any contractible space 

X, dimi/j(A, k) = 0 for all integers i. Clearly, M.u[X) C Cmin(A), and M.h{^) is thus a subset of 
the mandatory codewords that must be included in any convex code C with A(C) = A0 On the other 
hand, ^ C does not guarantee that C has no local obstructions, as a homologically trivial 

simplicial complex may be non-contractiblej^ 

It turns out that the entire set A4//(A) can be computed algebraically, via a minimal free resolution 
of an ideal built from A. Specifically, 

A4h(A) = {ct G a I (3i^a{S/lA*) > 0 for some i > 0}, (4) 

where S = k[xi,..., x„], the ideal Ia* is the Stanley-Reisner ideal of the Alexander dual A*, and 
/3i,aiS/lA*) are the Betti numbers of a minimal free resolution of the ring S/Ia*- This is a direct 
consequence of Hochster’s formula [9]: 

dim.Hi(Lk<^(A),k) = / 3 i+ 2 ,^{S/lA*)- (5) 

See Supplementary Text S2 for more details on Alexander duality, Hochster’s formula, and the Stanley- 
Reisner ideal. 

Moreover, the subset of mandatory codewords A4 h(A) can be easily computed using existing 
computational algebra software, such as Macaulay2 [18]. We now describe this via an explicit example. 

Example 5.1. Let A be the simplicial complex L25 in Figure]^ The Stanley-Reisner ideal is given 
by Ia = (X 1 X 2 X 4 , X 2 X 3 X 4 ), and its Alexander dual is Ia* = {xi, X 2 , X 4 ) Cl (x 2 ,X 3 ,X 4 ) = (X 1 X 3 , X 2 , X 4 ). A 

^Note that while Mh(X) depends on the choice of field k, Mh{X) C Cmin(A) for any k. 

®For example, consider a triangulation of the punctured Poincare homology sphere: this simplicial complex has all- 
vanishing reduced homology groups, but is non-contractible m- 
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minimal free resolution of S/Ia* is: 


0 ^— S/Ia* ^ 


[ X 1 X 3 X 2 X 4 ] 


s{-2)es{-i)^ ^ 


X 2 X 4 0 

—X1X3 0 X4 

0 -X 1 X 3 -X 2 


X4 

-X 2 

XIX3 


Si-4) 


The Betti number (ii^aiS/lA*) is the dimension of the module in multidegree a at step i of the resolution, 
where S/Ia* is step 0 and the steps increase as we move from left to right. At step 0 , the total degree 
is always 0. For the above resolution, the multidegrees at «S'(—2) © S'(—1)^ (step 1) are 1010, 0100, and 
0001; at 5(—3)^©S'(—2) (step 2), we have 1110, 1011, and 0101; and at £'(—4) (step 4) the multidegree 
is 1111. This immediately gives us the nonzero Betti numbers; 

/3o,oooo(£/-^a*) = 1, /3i,ioio(£/.^a*) = 1, /3i,oioo(£/-^a*) = 1, /5i,oooi(£/-^a*) = 1, 
/32,111o(£/-^A*) = 1, /32,101i(£/.^A*) = 1, /32,010i(£/-^A*) = 1, /53,llll('S'/.fA*) = 1- 

Recalling from equation Q that the multidegrees correspond to complements a of faces in A, we can 
now immediately read off the elements of A4//(A) from the above for i > 0 as: 

Mh{^) = {0101,1011,1110,0001, 0100,1010, 0000} = {24,134,123,4, 2,13, 0}. 


Note that the first three elements of A4 h(A) above, obtained from the Betti numbers /3i,* in step 

1 of the resolution, are precisely the facets of A. The next three elements, 0001,0100, and 1010, are 
mandatory codewords: they must be included for a code with simplicial complex A to be convex. These 
all correspond to pairwise intersections of facets, and are obtained from the Betti numbers / 32 ,* at step 

2 of the resolution; this is consistent with the fact that the corresponding links are all disconnected, 
resulting in non-trivial //o(Lk(T(A), k). The last element, 0000, reflects the fact that Lk 0 (A) = A, and 
dim//i(A,k) = 1 for A = L25. By convention, however, we always include the all-zeros codeword in 
our codes (see Section 1.2). 

Using Macaulay2 m, the Betti numbers for the simplicial complex A above can be computed 
through the following sequence of commands (choosing k = Z 2 , and suppressing outputs except for the 
Betti tally at the end): 


11 

12 

13 

14 

15 

16 
17 


kk = ZZ/2; 

S = kk[xl,x2,x3,x4, Degrees => {{1,0,0,0},{0,1,0,0},{0,0,1>0},{0,0,0,1}}]; 

I = niononilalIdeal(xl*x2*x4,x2*x3*x4); 

Istar = dual(I); 

M = S'l/Istar; 

Mres = res M; [comment: this step computes the minimal free resolution] 

peek bettl Mres 


(0, 

{0, 

0, 

0, 

0}, 

0) 

=> 

1} 

(1. 

{0, 

0, 

0, 

1}. 

1) 

=> 

1 

(1. 

{0, 

1, 

0, 

0}. 

1) 

=> 

1 

(1. 

{1, 

0, 

1, 

0}. 

2) 

=> 

1 

(2, 

{0, 

1, 

0, 

1}. 

2) 

=> 

1 

(2, 

{1, 

0, 

1, 

1}. 

3) 

=> 

1 

(2, 

{1, 

1, 

1, 

0}, 

3) 

=> 

1 

(3, 

{1, 

1, 

1, 

1}. 

4) 

=> 

1 
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Each line of the BettiTally displays (i, {fi}, |cj|) ^ This yields (in order): 

/3o,0000 = 1) /5l,0001 = 1) /? 1,0100 = 1) A ,1010 = 1) /32,0101 = 1) = 1, /32,1110 = 1) /^spiii = 1; 

which is the same set of nonzero Betti numbers we previously obtained. Recalling again that the 
multidegrees correspond to complements d in Q, and we care only about i > 0, this output immediately 
gives us - exactly as before. 


The above example illustrates how computational algebra can help us to determine whether a code 
has local obstructions. However, as noted in Section even codes without local obstructions may 
fail to be convex. Though we have made significant progress via Theorem 1.3 
combinatorial characterization of convex codes is still an open problem. 


finding a complete 
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SI Classification of convex codes for n < 4 


For n = 1 
convention 
codes on 3 


or n = 2, all codes are convex. The first non-convex codes appear for n = 3. Using our 
that all codes include the all-zeros codeword, there are a total of 40 permutation-inequivalent 
neurons [T]. Of these, only 6 are non-convex (see Tableand Lemma 6.1). 


label 

code C 

A(C) 

B3 

000, 010, 001, 110, 101 

L 6 

B5 

000, 010, 110, 101 

L 6 

B 6 

000, 110, 101 

L 6 

E2 

000,100, 010,110,101, on 

L7 

E3 

000,100, no, 101, on 

L7 

E4 

000, no, 101, on 

L7 


Table 1: All non-convex codes on n = 3 neurons, up to permutation equivalence. Code labels are the 
same as in [T], and simplicial complex labels are as in Figures]^ and 


Lemma 6.1. There are 6 non-convex codes on n < 3 neurons, up to permutation equivalence. They 
are the codes shown in Table [TJ 


Proof. First, we show that all six codes in Table are not convex. Let Al 6 and Alt be the sim¬ 
plicial complexes labeled L 6 and L7 in Figures and It is easy to see that {1} G Cmin(AL 6 ) and 
{1},{2},{3} G C„,in (Alt), because the corresponding links are non-contractible. Codes B3, B5, and 
B 6 all have simplicial complex Al 6 , but are missing the codeword 100, corresponding to {!}. Since 
{1} G Cmin(AL6)) by Theorem 1.3 these codes each have a local obstruction and thus cannot be convex. 
Codes E2, E3, and E4 all have simplicial complex Alt, but are missing the codeword 001, corresponding 
to {3}. Since {3} G Cmin(ALT)) these codes cannot be convex. All remaining codes for n = 3 neurons 
were shown to be convex in [T], via explicit convex realizations in two dimensions. □ 


We now consider n < 4. Eigure displays all simplicial complexes on n < 4 vertices, up to 
permutation equivalence, and highlights all intersections of two or more facets. By inspection, we see 
that every link corresponding to a non-empty max intersection is not contractible. We thus have the 
following lemma: 


Lemma 6.2. Let C be a neural code on n < 4 neurons. Then C has no local obstructions if and only 
if C is max H-complete. 


Proof. Let A = A(C) be the simplicial complex on n < 4 vertices corresponding to the code C. Recall 
from Theorem |1.3| that Cniin(A) consists of all a G A such that Lko-(A) is not contractible. Since, 
for n < 4, all non-empty max intersections a G J-n(^) have non-contractible links, it follows that 
Cmin(A) = T'n(^)- Therefore, by Theorem 1.3, C has no local obstructions if and only if C 5 T'n(':^)- ^ 
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Note that the proof of Lemma 6.2 relied on showing that Cniin(^) = More generally, for any 

code C whose simplicial complex A satisfies Cmin(A) = we know that C has no local obstructions 


if and only if it is max H-complete. Combining this with Proposition 1.8, we have the following: 


Proposition 6.3. Let A be a simplicial complex satisfying 
A(C) = A, C is convex if and only if it is max H-complete. 


i(A) = Then for any C with 


As a corollary of Lemma 6.2 and Proposition |6.3[ we obtain Proposition 1.7 


In fact, for n < 4 we have generated explicit convex realizations for all max H-complete codes. 
Figure [7] illustrates convex realizations for the max H-complete codes corresponding to most of the 
simplicial complexes L1-L28. The max H-complete codes for the omitted complexes L1-L5, L9-L10, 
and L12 all have obvious convex realizations, while those for L15 and L16 are obvious given the 
realizations for L7 and L8, respectively. 

Table summarizes our classification of n < 4 codes and provides minimal embedding dimensions 
d(C) for all the convex codes. Note that d{C) sometimes varies for codes with the same simplicial 
complex, depending on which optional codewords are included. 
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L10 

1 

• 

d = 1 

L2° 

•2 

d = l 

L3° 

II 

"D 

•-• 

L4° 1 

• 

2* *3 

d = 1 

L5° ^ 

/., 

d = l 

L6 , 

A 

d = 1 

L7+ 1 

A 

d = 2 

L8° 

A 

d = l,2 

L9° ^ 

• 

• 3 

d = l 

L10° ^ 

/ ■■ 

4 

d = 1 

L11 ^ 

/ 

*4 

d = l 

L12° 

A’ 

d = 1 

II 

A" 

L14 ^ 

■A" 

d = 1,2 

L15 + 

/ 

*4 

d = 2 

LI 6° 

A 

d = 1,2 

L17 1 

d = 2 

L18 1 

A' 

d = l,2 

L19 + 

A 

d = 2 

L20+ 1 

A 

d = 2 

L21 1 

d = 2 

L22 1 

d = l,2 

L23+ 1 

A 

d = 2 

L24 1 

A' 

d = 2, 3 

L25 1 

d = 2 

L26 1 

A 

d = 2 

L27+ 1 

^4 

d = 3 

L28° 1 

d = l,2 


Figure 6: Classification of convex codes for n < 4. L1-L28 are the 28 possible simplicial complexes A = A(C) that 
can arise, up to permutation equivalence. For each simplicial complex, faces highlighted in color correspond to 
pairwise (red) and triple (orange) intersections of facets (maximal faces). One can check that for each colored face 
cr, Lkcr(A) is not contractible. By Proposition |1.7[ any code C with simplicial complex A is convex if and only if C 
includes all codewords corresponding to the colored faces. All other non-maximal faces are optional codewords, 
whose inclusion or exclusion does not affect convexity. Simplicial complexes with no optional codewords are 
labeled +, while those with no (non-maximal) mandatory codewords are labeled o. When both -|- and o labels 
apply, as in LI, L2, and L4, we simply use o. The possible minimal embedding dimensions d = d(C) that can 
arise for convex codes are shown in the lower right corners. 


19 















A 

# non- 

convex 

codes 

maximal 

codewords 

(facets) 

mandatory 

codewords 

(non-maximal) 

optional 
codewords 
in A 

d(C) 

picture 

notes 

Ll° 

0 

1 

none 

none 

1 



L2° 

0 

10, 01 

none 

none 

1 



L3° 

0 

11 

none 

all (2) 

1 



L4° 

0 

100, 010, 001 

none 

none 

1 



L5° 

0 

110, 001 

none 

all (2) 

1 



L6 

3 

110, 101 

100 

010, 001 

1 

L6-series 


L7+ 

3 

110 , 101 , on 

all (3) 

none 

2 

L7-series 

convex C = A(C) 

L8° 

0 

111 

none 

all (6) 

1, 2 

L8-series 

d = 2 ★ 

L9° 

0 

1000, 0100 

0010, 0001 

none 

none 

1 



L10° 

0 

1100, 0010, 0001 

none 

all (2) 

1 



Lll 

3 

1100, 1010, 0001 

1000 

0100, 0010 

1 

L6-series 


L12° 

0 

1100, 0011 

none 

all (4) 

1 



L13 

7 

1100, 1010, 0011 

1000, 0010 

1000, 0001 

1 

L6-series 


L14 

4 

1100, 1010, 1001 

1000 

0100, 0010, 0001 

1, 2 

L6-series 

d = 2 -k 

L15+ 

3 

1100, 1010 

0110, 0001 

all (3) 

none 

2 

like L7 

convex C = A(C) 

L16° 

0 

1110, 0001 

none 

all (6) 

1, 2 

like L8 

d = 2 -k 

L17 

10 

1100, 1010 

0110, 0101 

1000, 0100 

0010 

0001 

2 

L7-series 


L18 

40 

1110, 0101 

0100 

1000, 0010, 0001 

1100, 1010, 0110 

1, 2 

L8-series 

d = 2 C contains 
{1000,0010}, or 
{1100,0110,1010} 

L19+ 

5 

1100, 1010 

0101, 0011 

all (4) 

none 

2 

L6-series 

convex C = A(C) 

L20+ 

8 

1100, 1010, 1001 
0110, 0011 

all (4) 

none 

2 

L7-series 

convex C = A(C) 

L21 

68 

1110, 0101, 0011 

0100, 0010 

0001 

1000, 1100 

1010, 0110 

2 

L8-series 


L22 

62 

1110, 0111 

0110 

1000, 0100, 0010 

0001, 1100, 1010 

0101, 0011 

1, 2 

L22-series 

d = 2 4^ k, 
or {0101,0011} CC, 
or {1100,1010} cC 

L23+ 

4 

1100, 1010, 1001 

0110, 0101, 0011 

all (4) 

none 

2 

L7-series 

convex C = A(C) 

L24 

36 

1110, 1001 

0101, 0011 

1000, 0100 

0010, 0001 

1100, 1010, 0110 

2, 3 

L8-series 
d=3 codes 

d = 3 no 

optional codewords 

L25 

168 

1110, 1011, 0101 

0100, 0001 

1010 

1000, 0010, 1100 

1001, 0110, 0011 

2 

L22-series 


L26 

407 

1110, 1011, 0111 

0010, 1010 

0110, 0011 

1000, 0100, 0001 

1100, 1001, 0101 

2 

L22-series 


L27+ 

85 

1110, 1101 

1011, 0111 

all (10) 

none 

3 

d=3 codes 

convex C = A(C) 

L28° 

0 

nil 

none 

all (14) 

1, 2 

L22-series 

d = 2 ★ 


Table 2: Convexity and dimension for codes on n < 4 neurons. For each simplicial complex A, labeled as 
in Figure]^ the second column is the number of non-convex codes C such that A(C) = A, up to permutation 
equivalence and including the all-zeros codeword, while the sixth column d{C) displays the possible minimal 
embedding dimensions for convex codes only. The third column lists the codewords corresponding to facets of 
A; these are automatically included in any code with simplicial complex A. The fourth columns gives all other 
non-empty mandatory codewords - that is, elements of Cmin(A) that are not facets of A. Optional codewords are 
elements of A whose presence or absence does not affect whether or not a code is convex, though they may alter 
the minimal embedding dimension d(C). When all non-maximal codewords are mandatory or all are optional, 
their total number is given in parentheses. The picture column indicates the groupings used for the convex 
realizations in Figure In the notes column, * indicates that the set of optional codewords in C can not form a 
2-chain. A collection of codewords forms a chain if we can completely order the respective sets by containment 
- so {1111,1100,1000} is a chain, but {1110,1000,1101} is not. A collection of codewords can form a 2-chain if 
it can be partitioned into two sets (possibly empty) which are both chains. -I- and o are the same as in Figure]^ 
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L6 - series 

L6 



L22 - series 



d=3 codes 



Figure 7: Convex realizations for codes on n < 4 neurons. Each convex set Ui, for neuron i, is 
the union of all regions corresponding to codewords containing i (see also Figure]^. Note that each 
picture displays regions corresponding to mandatory codewords in various shades of gray, while optional 
codewords are in blue. A single picture thus shows convex realizations for all convex codes corresponding 
to the same simplicial complex, as blue regions can be included or excluded without affecting convexity. 
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S2 Alexander duality, the Stanley-Reisner ideal, and Hochster’s formula 

For any simplicial complex A on vertex set [n], the Alexander dual is the related simplicial complex: 

A* {r I r ^ A}, 


where r = [n] \ r denotes the complement of r in [n]. Note that (A*)* = A. Any A also has an 
associated ideal known as the Stanley-Reisner ideal: 

fA I n I ^ ^ 

liSo- 



Alexander duality relates the reduced homology of a simplicial complex to the cohomology of its 
Alexander dual, 


Ri(A*,k) ^ 


where k is a field. Meanwhile, Hochster’s formula relates the nonzero Betti numbers from a minimal 
free resolution of the Stanley-Reisner ideal to the reduced cohomology of restricted simplicial complexes 
[9]. Specifically, 

A-i,.(/a) = PiAS/lA) = dimRH-*-i(A|,,k), ( 6 ) 

where S = k[xi,..., Xn] and the refer always to Betti numbers for a minimal free resolutionj^ 
The following link lemma connects Hochster’s formula Q to its dual version, below. The dual 
formulation is more useful to us, as it allows us to compute the dimensions of all non-trivial homology 
groups for all links, LkCT(A), from a single free resolution. 


Lemma 6.4. Lko-(A) = (A*|ct)*. 


Proof. First, observe that /S*\a = {t \ t (Z a and r G A*}. The dual is thus (A^l^)* = {it \ r | r C 
a and r 0 A*} = {uj \ oj Z a and (t\uj ^ A*} = {a; | cj n cr = 0 and a U a; G A} = Lko-(A). □ 

Lemma 6.5 (Hochster’s formula, dual version, cf. [9l Corollary 1.40]). 


dimHi(Lk<^(A),k) = Pi+ 2 ,a{S /1^*). 


Proof. Using (in order) Lemma |6.4[ Alexander duality, and the original version of Hochster’s for¬ 
mula Q, we obtain: dimHj(Lko-(A), k) = dimHj((A*|CT)*, k) = dimiLl'’"l“*“^(A*| 5 -, k) = (3i+2,aiS/lA*)- 


□ 


It is important to note that if cr is a facet of A, then Lk(j(A) = 0, which is non-contractible due 
to nontrivial homology in degree —1. Hochster’s formula thus detects facets of A via the nonzero 
Betti numbers A,a, as these correspond to a such that dimH_i(Lko-(A), k) > 0. Note also that 
Lk 0 (A) = A, so if A itself has nontrivial reduced homology in degree i, this will be detected as a 
nonzero Betti number / 3 j+ 2 ,[n]) where d = [n] is the complement of cr = 0. 


S3 Bounds on the minimal embedding dimension of convex codes 

We now turn to the problem of determining the minimal embedding dimension d{C) of a convex code C. 
There is no general method for computing d{C), though bounds can be obtained from the information 
present in the simplicial complex A(C). In this section, we review known results on d-representability 
and Helly’s theorem, and apply them to obtain lower bounds on d{C). We also obtain an additional 
bound on d{C) from the Fractional Helly theorem, and examine how it compares to the Helly’s theorem 
bound. 

'’See Chapter 1] for more details about free resolutions of monomial ideals. 
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Our dimension bounds all rely solely on features of the code captured by A(C), and do not take 
into account the finer structure of the code. Nevertheless, the presence or absence of a single codeword 
can have a significant effect on d{C), even if the simplicial complex A = A(C) is hxed (e.g., see Table 
for L 8 , L14, L16, etc.). It remains an open question how to use this additional information in order to 
improve the bounds on d{C). 

Embedding dimension and d-representability 

The problem of determining d(C) for a convex code C has not been directly addressed in the literature. 
However, the related problem of determining when a simplicial complex A can be realized as the nerve 
N{U) of a cover lA has received considerable attention (see |13LI19| and references therein). A simplicial 
complex A is said to be d-representable if there exists a collection of convex (not necessarily open) sets 
U = {Ui, ..., Un}, with Ui C such that A = J\f{U). Note that for such a U, the corresponding code 
C{U) need not be equal to A, though it is always true that iS.{C{U)) = A. 

While d-representability of A(C) does not tell us the value of d{C), it does provide a lower bound. 
This motivates us to define the nerve dimension dj\f{C) of a code C to be the minimal d such that A(C) 
is d-representable. It immediately follows that 

d{C) > d^iC), 

because any embedding of C in via a collection of convex open sets U is also a realization of A(C) 
as the nerve MiU). Unfortunately, dj^{C) may be difficult to compute in general. In contrast, we can 
obtain a lower bound from Helly’s theorem that is simple to read off from the structure of A(C). 


Bounds from Helly’s theorem 

One common tool used to address d-representability of simplicial complexes is Helly’s theorem. 

Lemma 6.6 (Helly’s theorem [20]). Let lA = {C/i,..., Un} be a collection of convex open sets in M'’*. 
If for every d -|- 1 sets in lA, the intersection is non-empty, then the full intersection nr=i 
non-empty. 


Helly’s theorem implies that if A is d-representable and A contains all possible d-dimensional faces, 
then A must be the full simplex. On the other hand, if A contains all possible d-dimensional faces 
but is not the full simplex, then it is not d-representable. This immediately yields examples where the 
presence or absence of a single codeword can have a large effect on d(C). 

Proposition 6.7. Let C be a code on n neurons, and suppose that for some k with 1 < k < n, A(C) 
contains all /c-dimensional faces. If 11 • • • 1 G C, then d(C) < 2; otherwise, d(C) > k. 


Proof. 
Lemma 2.5 


In the first case, where 11 • • • 1 G C 
For the second case, where 11 •• 


the fact that C is convex and d(C) < 2 follows from 
1 ^ C, suppose C is realizable as a convex code in for 
some d < k, so that C = CiU) for some collection of convex open sets lA = {Ui,U 2 , ■ ■ ■ ,Un}, with each 
Ui C Since, by hypothesis, A(C) contains all A;-dimensional faces, it also contains all d-dimensional 
faces, and so the intersection of every collection of d -|- 1 subsets in lA is non-empty. Thus, by Helly’s 
Theorem, the full intersection of all sets in lA is non-empty, and so 11 • • • 1 G C. This contradicts the 
fact that 11 • • • 1 ^ C; hence, we must have d{C) > k. □ 


We can also apply Helly’s theorem to every subcollection {Uq,..., C lA, or equivalently 

to the induced subcomplex on elements ii,... ,im, to see that if all the d-dimensional faces of this 
subcomplex are present, then the top-dimensional face must also be present in order for A to be d- 
representable. This leads us to the following definitions. A simplicial complex is said to contain an 
induced k-dimensional simplicial hole if it contains k + 1 vertices such that the induced subcomplex 
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on those vertices is isomorphic to a hollow simplex (see Section 1.2, and m)- We define the Helly 
dimensior^ of C, denoted dniC), to be the dimension of the largest induced simplicial hole of A(C): 


def 


dniC) = max{fc | A(C) has a fc-dimensional induced simplicial hole}. 
Clearly, dniC) < d{C). 


(7) 


Bounds from the Fractional Helly theorem 

The Fractional Helly theorem is a well-known extension of Helly’s theorem that provides new bounds 
on d{C), though they are not always better. 

Lemma 6.8 (Fractional Helly theoren0 Theorem 6.7 of [l3]). Let a > 0, and U = {Ui,U 2 , ■ ■ ■ ,Un} 
be a collection of convex open sets in M'^uch that at least a(^”i) of the (dT l)-tuples of sets in U have 
non-empty intersections. Then there exists a C [n] such that |fT| > -;^n, and Hieo- non-empty. 

The Fractional Helly theorem indicates that if a code C can be embedded in and the simplicial 
complex A(C) has many d-dimensional faces, then A(C) must have some sufficiently high-dimensional 
face. The following lemma quantifies these observations in our context. 

Lemma 6.9. Let A be a ^-dimensional simplicial complex on n elements, and let /d(A) be the number 
of d-dimensional faces in A for 1 < d < n. If A is d-representable, then k + 1 > 

Proof. By definition of d-representable, we have A = J\f{U) for some U = {Ui, U 2 , ■ ■ ■, Un}, where each 
Ui C Since each d-dimensional face of A corresponds to an intersection of (d -|- 1) of the UiS, 
we have that /rf(A(C)) of the (d -|- l)-tuples have non-empty intersections. By the Fractional Helly 

where a = 7^/ . Since A(C) 

is /c-dimensional, it follows that {al < k + 1, and so k + 1 > = /d(A(C))/D 

This leads us to the following definition. Let C be a code on n neurons with a /c-dimensional 
simplicial complex A(C), and let fd{A{C)) be the number of d-dimensional faces in A(C) for 1 < d < n. 
The Fractional Helly dimension dpH^C) of C is given by: 

, 1 < d < n| . ( 8 ) 

Further dimension bounds based on the /-vector {/j(A)} can be obtained from the results in [221I23| . 


dFH(C) 1 -hmax} d fd{A{C))>{k + l) 


n — 1 


theorem, there is some a C [n] with |cj| > such that Hieo- 


Comparison of dimension bounds 


How do the Helly and Fractional Helly dimensions dniC) and dpuiC) compare to each other and to 
the nerve dimension djy{C)l First, note that although a simplicial complex A cannot be represented in 
any dimension less than its Helly dimension d//(A), Helly’s theorem does not guarantee that A is dn- 
representable. Thus we have dniC) < dj\f{C) < d(C). By the same reasoning, dpuiC) < djy{C) < d{C). 
The next examples show that we can have dniC) < dj\f{C) and dpH{C) < dj\f{C)] i.e., the nerve 
dimension can provide a strictly stronger lower bound. 


°The closely-related notion of Helly number for a simplicial complex was previously introduced in the literature. 
Specifically, the Helly number of A(C) is drr(C) + 1. 

®In [2] I Kalai gave a sharp version of Theorem 


6.8 


If of the {d+ l)-tuples have non-empty intersections, then 

there exists a with |(t| > (1 — (1 — such that the corresponding intersection is non-empty. The precise value 

of P for which |(t| > Pn is not essential to the results in the remainder of our paper. Therefore, for ease of notation and 
computation, we prefer /3 = a/(d+ 1) as in Theorem 6.8 while keeping in mind that a tighter bound exists. 
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a 


b 


1 2 



4 3 



Figure 8: (a) A simplicial complex A, with facets 125, 235, 345, 145. (b) A convex realization of A 

in 


Example 6.10. Consider any code C such that A(C) is the simplicial complex in Figure]^. We obtain 
duiC) = 1, because the two maximal induced simplicial holes of A(C) arise from the subsets {1,3} 
and {2,4}, which both have dimension 1. Although A(C) is contractible, the induced subcomplex on 
{1,2,3,4} is a 1-cycle, so A(C) is at best 2-representable and dj\f{C) > 2. Thus, dj\f{C) > dniC). 
Figure]^ shows that the minimal embedding dimensions is, in fact, d{C) = 2. 

Our final example shows not only that we can have dpuiC) < dj\f{C), but also that it is possible 
for dpniC) < dniC) or for dniC) < dpHiC)-, depending on the code. So although dj\f{C) is always the 
strongest of the three bounds, neither of the easier-to-compute dniC) and dpuiC) bounds is universally 
stronger than the other. In other words, all we can say in general is that the minimal embedding 
dimension satisfies: 

d{C) > dM{C) > iaayi{dH{C),dFH{C)]. (9) 


a b 



Figure 9: (a) The simplicial complex of code Ci from Example 6.11[ 

code C 2 from Example |6.1H for r = 4. 


(b) The simplicial complex of 


Example 6.11. (a) Let n = 3, and consider the code Ci = {000,110,101,011}, whose simplicial 
complex A(Ci) is the empty triangle, as shown in Figure]^. This is a hollow 2-simplex, so we have 
dniCi) = 2 and dj^{Ci) = 2. In contrast, a quick computation yields dpuiCi) = 1, and so dniCi) = 
dj\f{Ci) > dpHiCi). 

(b) Let n = 2r, where r > 4, and suppose A(C2) = Kr^r, the complete bipartite graph on 2r vertices, 
as shown in Figure]^. This graph contains no triangles, so the largest induced simplicial holes result 
from missing edges in A(C2), which have dimension 1. Thus, d_ff(C2) = 1. To compute dFH{C 2 ), we 
first find the /-vector of A(C2). Observe that /o(A(C2)) = n, /i(A(C2)) = r^, and /i(A(C'2)) = 0 for 
2 < i < n. Note also that k = 1, since A(C2) is 1-dimensional. Since /rf(A(C2)) = 0 for d > 2, the 
inequality /d(A(C2)) > (A: -|- 1) • is not satisfied for these values of d. However, we have 

/i(A(C 2)) = r2 > 2(2r - 1) = 2(n - 1) = (A: + 1) • ^ , 

with the inequality being valid for all r > 4. Thus, directly from the definition, we find dFH(C 2 ) = 2. 
Hence, dFH{C 2 ) > dH{C 2 )- 
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